Probabilistic Fault Diagnosis Using Adaptive Probing
نویسندگان
چکیده
Past research on probing-based network monitoring provides solutions based on preplanned probing which is computationally expensive, is less accurate, and involves a large management traffic. Unlike preplanned probing, adaptive probing proposes to select probes in an interactive manner sending more probes to diagnose the observed problem areas and less probes in the healthy areas, thereby significantly reducing the number of probes required. Another limitation of most of the work proposed in the past is that it assumes a deterministic dependency information between the probes and the network components. Such an assumption can not be made when complete and accurate network information might not be available. Hence, there is a need to develop network monitoring algorithms that can localize failures in the network even in the presence of uncertainty in the inferred dependencies between probes and network components. In this paper, we propose a fault diagnosis tool with following novel features: (1) We present an adaptive probing based solution for fault diagnosis which is cost-effective, failure resistant, more accurate, and involves less management traffic as compared to the preplanned probing approach. (2) We address the issues that arise with the presence of a non-deterministic environment and present probing algorithms that consider the involved uncertainties in the collected network
منابع مشابه
Problem Diagnosis in Distributed Systems using Active Probing
As distributed systems continue to grow in size and complexity, scalable and cost-effective techniques are needed for performing tasks such as problem determination and fault diagnosis. We address these tasks using probes, or end-to-end test transactions, which gather information about system components (e.g., using IBM’s EPP technology). Effective probing requires minimizing the cost of probin...
متن کاملEfficient probe selection algorithms for fault diagnosis
Increase in the network usage for more and more performance critical applications has caused a demand for tools that can monitor network health with minimum management traffic. Adaptive probing has the potential to provide effective tools for end-to-end monitoring and fault diagnosis over a network. Adaptive probing based algorithms adapt the probe set to localize faults in the network by sendi...
متن کاملUsing Adaptive Probing for Real-Time Problem Diagnosis in Distributed Computer Systems
In this work, we focus on cost-efficient techniques for realtime diagnosis in distributed systems that allow an adaptive, on-line selection and execution of appropriate measurements (tests). Particularly, one of our applications concerns fault diagnosis in distributed computer systems and networks by using test transactions, or probes (e.g., ”traceroute” or ”ping” commands). The key efficiency ...
متن کاملEfficient fault diagnosis using probing
In this paper, we address the problem of efficient diagnosis in real-time systems capable of on-line information gathering, such as sending ”probes” (i.e., test transactions, such as ”traceroute” or ”ping”) in order to identify network faults and evaluate performance of distributed computer systems. We use a Bayesian network to model probabilistic relations between the problems (faults, perform...
متن کاملAdaptive Probing: A Monitoring-Based Probing Approach for Fault Localization in Networks
The past research in fault localization in distributed data centers has used either monitoring or probing techniques in isolation. In this paper, we argue that effective fault localization solutions can be built by exploiting the information captured by both the techniques. Based on this concept, we propose an adaptive probing solution for fault localization where information from monitoring ag...
متن کامل